a hybrid feature subset selection algorithm for analysis of high correlation proteomic data

نویسندگان
چکیده

pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. the surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (seldi-tof ms) has been used to generate proteomic profiles from biological fluids. mass spectrometry yields redundant noisy data that the most data points are irrelevant features for differentiating between cancer and normal cases. in this paper, we have proposed a hybrid feature subset selection algorithm based on maximum-discrimination and minimum-correlation (mdmc) coupled with a peak scoring criteria. our algorithm has been applied to two independent seldi-tof ms datasets of ovarian cancer obtained from the nci-fda clinical proteomics databank. the proposed algorithm has used to extract a set of proteins as potential biomarkers in each dataset. we applied the linear discriminate analysis (lda) to identify the important biomarkers. the selected biomarkers have been able to successfully diagnose ovarian cancer patients from non-cancer control group with accuracy of 100%, sensitivity of 100%, and specificity of 100% in the two datasets. the hybrid algorithm has the advantage that increase reproducibility of selected biomarkers and able to find a small set of proteins with high discrimination power.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data

Pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. The surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) has been used to generate proteomic profiles from biological fluids. Mass spectrometry yields redundant noisy data that the most data points are irrelevant features ...

متن کامل

A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum

Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...

متن کامل

a new hybrid feature subset selection algorithm for the analysis of ovarian cancer data using laser mass spectrum

introduction: amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. the chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. a major challenge in extracting such ...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

Constrained Based Feature Subset Selection Algorithm for High Dimensional Data

Feature Selection is to selecting the useful features from the original dataset for improve the more accurate results. Constrained Based Feature Subset Selection(CFSS) Algorithm Removes irrelevant and redundant features. This method is to find a similarity computation based on the entropy and conditional entropy values. After computing similarity computation to applied Approximate Relevancy(AR)...

متن کامل

A Binary Pso-aco Hybrid Algorithm for Feature Subset Selection

Feature Selection is the process of selecting a subset of features available, allowing a certain objective function to be optimized, from the data containing noisy,irrelevant and redundant features. This paper presents a novel feature selection method that is based on hybridization of ACO with a binary PSO to obtain excellent properties of two algorithms by synthesizing them and aims at achievi...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of medical signals and sensors

جلد ۲، شماره ۳، صفحات ۰-۰

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023